Image capturing device

ABSTRACT

An image capturing device/system is provided with attention directing functions for directing attentions of imaging targets. In particular, the image capturing device may include a presentation device, such as a visual or an audio presentation device. The presentation device may be provided on a side of the image capturing device facing the imaging targets when the image capturing device captures the images of the imaging targets. The presentation device may be configured to display or output a presentation before and/or during an image capturing process to direct the attentions of the imaging targets toward the image capturing device.

BACKGROUND

Field of the Invention

The present invention generally relates to an image capturing device,and in particular, an image capturing device with presentation functionsfor directing the attention of imaging targets.

Related Art

A user of an image capturing device may use the image capturing deviceto capture images of various imaging targets including persons. Whencapturing images of persons, it is often preferable that the personslook in the directions of the image capturing device. However, thepersons (e.g., children) may not be interested in looking in thedirection of the image capturing device or may not know when to look atthe image capturing device. As such, it may be difficult for the user ofthe image capturing device to capture satisfactory images of thepersons. Thus, there is a need for an improved image capturing devicethat includes functions for directing the attention of imaging targets.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 is a diagram illustrating an imaging capturing device capturingimages of an imaging target according to an embodiment.

FIG. 2A is a functional block diagram of an image capturing devicesuitable for implementing attention directing presentations for imagingtargets according to an embodiment.

FIG. 2B is a perspective view of an image capturing portion of an imagecapturing device according to an embodiment.

FIG. 2C is a block diagram of a networked system including an imagingcapturing device suitable for implementing attention directingpresentations for imaging targets according to an embodiment.

FIG. 3 is a flowchart showing a process of setting up an image capturingdevice attention directing functions according to an embodiment.

FIG. 4 is a flowchart showing a process for capturing images withattention directing functions according to one embodiment.

Embodiments of the present disclosure and their advantages are bestunderstood by referring to the detailed description that follows. Itshould be appreciated that like reference numerals are used to identifylike elements illustrated in one or more of the figures, whereinshowings therein are for purposes of illustrating embodiments of thepresent disclosure and not for purposes of limiting the same.

DETAILED DESCRIPTION

According to an embodiment, an image capturing device/system is providedwith attention directing functions for directing the attention of one ormore imaging targets. In particular, the image capturing device mayinclude a presentation device, such as a visual or an audio presentationdevice. The presentation device may be provided on a side of the imagecapturing device facing the imaging targets when the image capturingdevice captures the images of the imaging targets (as used herein,“imaging targets” includes one or more live targets, which can includepersons or animals).

The presentation device may be configured to display or output apresentation before and/or during an image capturing process to directthe attention of the imaging targets towards the image capturing device.For example, the presentation device may display a video including audiooutput to attract and direct the attention of the imaging targets whoseimage is to be taken toward the image capturing portion of the imagecapturing device and provoke a desired expression if possible. Thepresentation may be displayed a few seconds before and/or during theimage capturing process such that the imaging targets look at the imagecapturing device when the images are captured.

In an embodiment, the presentation may be customized based on theimaging targets whose image is to be captured by the image capturingdevice. For example, the image capturing device may detect andrecognize, e.g., by facial or voice recognition, the person or personswhose image is to be taken. The image capturing device may determineand/or customize the presentation to be presented to better attract theattention of the imaging targets, including provoking a desiredexpression, such as laughter, smile, surprise, etc. For example, theimage capturing device may present video or audio of cartoon charactersto attract the attention of a child or children if a child or childrenare identified in the imaging targets. The presentation may includevarious types of images, animations, and/or audio based on the type,number, location, time, and/or context of the imaging targets. Forexample, if the imaging targets include multiple persons of a certainage, the presentation may include a countdown to the image capturingmoment, such that the people in the group may know when to look at theimage capturing device.

In an embodiment, the image capturing device or system may determine thepresentation to be presented to the imaging targets based on user(person operating device or imaging targets) history, social media,and/or context. For example, the image capturing device may collect aperson's presentation preferences based on the person's online or mediaactivities, including browsing history, app usage, location, socialnetwork profiles, personal profiles, and the like. The presentationpreferences also may be determined by crowd sourcing, such as preferredpresentations by other users of similar type, age, gender, demography,and the like.

In an embodiment, the presentation may be determined based on context,such as location, setting, group dynamic, and the like. The presentationmay be determined or selected based on where and when the imagecapturing process is taking place. For example, if near a meal time,food related images may be used to attract imaging target's attention.

In an embodiment, the presentation may be customized to attractattention of multiple persons. For example, if the imaging targetincludes a group of people, the presentation may be selected ordetermined based on the group's shared interests. In another example,the group's interest may be prioritized based on variouscharacteristics. For instance, younger persons may have higher priority,because younger persons may have lower attention span compared witholder persons.

In an embodiment, the image capturing device may detect and analyze theattention status and/or movement of the imaging targets and maydetermine the presentation in real time to capture the attention of theimaging targets, as needed. For example, the image capturing device maydetect where the imaging target is looking and may determine and presentattention attracting presentation accordingly.

FIG. 1 is a diagram illustrating an imaging capturing device capturingimages of imaging targets according to an embodiment. In particular, auser 105 may operate an image capturing device 110 to capture images ofan imaging target. As shown in FIG. 1A, the imaging target may include aperson or persons or an animal or animals. The user 105 may prefer thatthe person or persons in the imaging target look in the direction of theimage capturing device 110 during an imaging capturing process, suchthat the faces of the persons may be captured by the image capturingdevice 110. In an embodiment, the image capturing device 110 may includea presentation device, such as a visual display or an audio outputdevice, configured to output a presentation to direct the attention ofthe persons in the imaging target towards the image capturing device110. The presentation device may output the presentation before orduring the image capturing process to direct and/or retain the attentionof the persons in the imaging target.

FIG. 2A is a functional block diagram of an image capturing device 110suitable for implementing attention directing presentations for imagingtargets according to an embodiment. In various implementations, theimage capturing device 110 may comprise a personal computing device(e.g., smart phone, a computing tablet, a personal computer, laptop,wearable device, Bluetooth device, key FOB, badge, etc.) capable ofcommunicating with an electronic communication network.

Image capturing device 110 includes a bus 202 or other communicationmechanism for communicating information data, signals, and informationbetween various components of image capturing device 110. Componentsinclude an input/output (I/O) component 204 that processes a useraction, such as selecting keys from a keypad/keyboard, selecting one ormore buttons or links, user gestures detected on a touch screen, etc.,and sends a corresponding signal to bus 202. I/O component 204 may alsoinclude an output component, such as a display 211 and a cursor control213 (such as a keyboard, keypad, mouse, touch screen, etc.). An optionalaudio input/output component 205 may also be included to allow a user touse voice for inputting information by converting audio signals. AudioI/O component 205 may allow the user to hear audio.

A transceiver or network interface 206 transmits and receives signalsbetween image capturing device 110 and other devices, such as anotheruser device, an application server, or a service provider server via anelectronic communication network. In one embodiment, the transmission iswireless, although other transmission mediums and methods may also besuitable. A processor 212, which can be a micro-controller, digitalsignal processor (DSP), or other processing component, processes thesevarious signals, such as for display on image capturing device 110 ortransmission to other devices via a communication link 218. Processor212 may also control transmission of information, such as cookies or IPaddresses, to other devices.

The image capturing device 110 may include a camera 215. The camera 215may be configured to capture images. In particular, the camera 215 maybe a digital camera including an image sensor, optical components suchas mirrors, lenses, and the like, configured to direct light to form animage on the image sensor, a shutter configured to capture a scene, anaperture configured to adjust an amount of exposure, and the like. Theimage sensor may be one or more of a semiconductor Charge-Coupled Device(CCD), a complementary metal oxide-semiconductor (CMOS), or the like.Images captured on the image sensor may be displayed on display 211 ofthe image capturing device 110. Thus, the display 211 may be used as aviewfinder. A viewfinder may allow a user to view the imaging targetfrom which images are to be captured by the image capturing device 110.A flash may be provided to illuminate and/or provide additional lightingon the imaging target. The camera 215 may be controlled by the processor212 to capture images. Images captured by the camera may be convertedinto digital signals/data and stored in a memory 214. The camera 215 mayalso have video capturing capabilities, e.g., having a video mode, as iscommon in smartphones, digital cameras, and computing tablets.

The image capturing device 110 may include a presentation device 217.The presentation device 217 may be configured to present or outputaudio, still images, and/or video to an imaging target, such as a personor persons (or a pet) whose image is to be captured (which includes bothstill images and video) by the image capturing device 110. Thepresentation may direct and/or retain the attention of the imagingtarget during the image capturing process. The presentation device 217may include one or more visual output devices, such as one or more of adisplay screen, a lighting device, a visual indicator, or other devicesconfigured to output visual presentations or signals, such as a videopresentation, lighting patterns, and the like. For example, thepresentation device 217 may include a Liquid-Crystal Display (LCD)screen configure to display images and/or videos. Lighting devices, suchas Laser Emitting Diodes (LED), may also be included and configured todisplay lighting patterns. In an embodiment, the presentation 217 mayinclude one or more audio output devices, such as audio speakers orother devices, configured to output audio presentation or signals, suchas sound, music, alarms, and the like. In an embodiment, thepresentation device 218 may include vibration functions configured tooutput a vibration or patterns of vibration to direct and/or retain theattention of the imaging target. The presentation device 217 may bepositioned to face the imaging target when the camera 215 is capturingan image of the imaging target. The presentation device 217 may becontrolled by the processor 212 to output presentations to the imagingtarget.

FIG. 2B is a perspective view of a front side of the image capturingdevice 110 according to an embodiment. The camera 215 and thepresentation device 217, e.g., a display screen may be arranged at afront side of the image capturing device 110. As such, during an imagecapturing device, the camera 215 and the presentation device 217 mayboth face the imaging target. The camera 215 may capture images orvideos of the imaging target while the presentation device 217 mayoutput presentations to the imaging target to direct and/or retain theattention of the imaging target. In an embodiment, the camera 215 may besurrounded by the presentation device 217, as shown in FIG. 2B. As such,the presentation device 217 may display an animation to direct attentiontoward the camera 215. In some embodiments, the presentation device 217may output both visual and audio presentations to direct and/or retainthe attention of the imaging target.

Components of image capturing device 110 also include a system memorycomponent 214 (e.g., RAM), a static storage component 216 (e.g., ROM),and/or a disk drive 217. Image capturing device 110 performs specificoperations by processor 212 and other components by executing one ormore sequences of instructions contained in system memory component 214.Logic may be encoded in a computer readable medium, which may refer toany medium that participates in providing instructions to processor 212for execution. Such a medium may take many forms, including but notlimited to, non-volatile media, volatile media, and transmission media.In various implementations, non-volatile media includes optical ormagnetic disks, volatile media includes dynamic memory, such as systemmemory component 214, and transmission media includes coaxial cables,copper wire, and fiber optics, including wires that comprise bus 202. Inone embodiment, the logic is encoded in non-transitory computer readablemedium. In one example, transmission media may take the form of acousticor light waves, such as those generated during radio wave, optical, andinfrared data communications.

Some common forms of computer readable media includes, for example,floppy disk, flexible disk, hard disk, magnetic tape, any other magneticmedium, CD-ROM, any other optical medium, punch cards, paper tape, anyother physical medium with patterns of holes, RAM, PROM, EEPROM,FLASH-EEPROM, any other memory chip or cartridge, or any other mediumfrom which a computer is adapted to read.

In various embodiments of the present disclosure, execution ofinstruction sequences to practice the present disclosure may beperformed by image capturing device 110. In various other embodiments ofthe present disclosure, a plurality of computer systems coupled bycommunication link 218 to the network (e.g., such as a LAN, WLAN, PTSN,and/or various other wired or wireless networks, includingtelecommunications, mobile, and cellular phone networks) may performinstruction sequences to practice the present disclosure in coordinationwith one another.

Where applicable, various embodiments provided by the present disclosuremay be implemented using hardware, software, or combinations of hardwareand software. Also, where applicable, the various hardware componentsand/or software components set forth herein may be combined intocomposite components comprising software, hardware, and/or both withoutdeparting from the spirit of the present disclosure. Where applicable,the various hardware components and/or software components set forthherein may be separated into sub-components comprising software,hardware, or both without departing from the scope of the presentdisclosure. In addition, where applicable, it is contemplated thatsoftware components may be implemented as hardware components andvice-versa.

Software, in accordance with the present disclosure, such as programcode and/or data, may be stored on one or more computer readablemediums. It is also contemplated that software identified herein may beimplemented using one or more general purpose or specific purposecomputers and/or computer systems, networked and/or otherwise. Whereapplicable, the ordering of various steps described herein may bechanged, combined into composite steps, and/or separated into sub-stepsto provide features described herein.

FIG. 2C is a block diagram of a networked system including an imagingcapturing device 110 suitable for implementing attention directingpresentations for imaging targets according to an embodiment. Networkedsystem 100 may comprise or implement a plurality of servers and/orsoftware components that operate to perform various payment transactionsor processes. Exemplary servers may include, for example, stand-aloneand enterprise-class servers operating a server OS such as a MICROSOFT®OS, a UNIX® OS, a LINUX® OS, or other suitable server-based OS. It canbe appreciated that the servers illustrated in FIG. 1 may be deployed inother ways and that the operations performed and/or the servicesprovided by such servers may be combined or separated for a givenimplementation and may be performed by a greater number or fewer numberof servers. One or more servers may be operated and/or maintained by thesame or different entities.

System 100 may include the image capturing device 110 and an applicationserver 170 in communication over a network 160. Application server 170may be maintained by a software company that develops and/or offersvarious applications or services related to image capturing devices. Auser 105, such as an operator or owner of the image capturing device,may utilize the image capturing device 110 to download and installapplications offered at the application server 170. For example, user105 may utilize image capturing device 110 to connect to the applicationserver 170 to manage contact information. Although only one applicationserver is shown, a plurality of application servers may be utilized.

Image capturing device 110 and application server 170 may each includeone or more processors, memories, and other appropriate components forexecuting instructions such as program code and/or data stored on one ormore computer readable mediums to implement the various applications,data, and steps described herein. For example, such instructions may bestored in one or more computer readable media such as memories or datastorage devices internal and/or external to various components of system100, and/or accessible over network 160. Network 160 may be implementedas a single network or a combination of multiple networks. For example,in various embodiments, network 160 may include the Internet or one ormore intranets, landline networks, wireless networks, and/or otherappropriate types of networks.

Image capturing device 110 may include hardware and software configuredfor wired and/or wireless communication over network 160. For example,in one embodiment, the image capturing device 110 may be implemented asa digital camera, a personal computer (PC), a smart phone, wearabledevice, laptop computer, and/or other types of computing devices capableof transmitting and/or receiving data, such as an iPad™ or an iPhone™from Apple™.

Image capturing device 110 may include a communication module includinghardware and applications for facilitating communications between imagecapturing device 110 and other communication devices. The communicationmodule may include hardware, such as CPU processor, WiFi card, BluetoothModule, Ethernet card, and the like, for various wired and wirelesscommunications including communication via LAN, WLAN, PTSN, LTE, 4G, or3G network, and/or various other wired or wireless networks, includingtelecommunications, mobile, and cellular phone networks. Image capturingdevice 110 also may include a social interaction monitoring moduleincluding a CPU processor, applications, the communication module,location detection device, and/or various sensors for monitoring user105 activities at image capturing device 110 and identifying socialinteractions conducted by user 105. Image capturing device 110 mayinclude a relationship module including the CPU processor, applications,and memory storing relationship database for inferring context andrelationship between user 105 and other users. In addition, imagecapturing device 110 may include a natural language processing moduleincluding the CPU processor and applications for processing andanalyzing text-based communications to extract context and relationshipfrom the communications.

Image capturing device 110 also may include one or more browserapplications 115 which may be used, for example, to provide a convenientinterface to permit user 105 to browse information available overnetwork 160. For example, in one embodiment, browser application 115 maybe implemented as a web browser configured to view information availableover the Internet, such as a user account for online shopping and/ormerchant sites for viewing and purchasing goods and services. Imagecapturing device 110 may also include one or more toolbar applications120 which may be used, for example, to provide client-side processingfor performing desired tasks in response to operations selected by user105. In one embodiment, toolbar application 120 may display a userinterface in connection with browser application 115. Image capturingdevice 110 also may include other applications to perform functions,such as email, texting, voice and IM applications that allow user 105 tosend and receive emails, calls, and texts through network 160, as wellas applications that enable the user to communicate, transferinformation, or make transactions.

Image capturing device 110 may include one or more user identifiers 130which may be implemented, for example, as operating system registryentries, cookies associated with browser application 115, identifiersassociated with hardware of image capturing device 110, or otherappropriate identifiers, such as used for payment/user/deviceauthentication. In one embodiment, user identifier 130 may be used toassociate user 105 with a particular application account at theapplication server 170. A communications application 122, withassociated interfaces, enables image capturing device 110 to communicatewithin system 100.

Image capturing device 110 may include an image capturing app 125configured to implement various functions related to image capturing.For example, the image capturing app 125 may provide an interface forthe user 105 to operate the image capturing device 110 to capture imagesof various imaging targets. Various settings for image capturing, suchas aperture, flash, image capturing mode, color tone, image capturingtheme/style, shutter speed, and the like, also may be adjusted by theuser 105 through the image capturing app 125.

The image capturing app 125 also may provide functions for outputtingattention directing presentations to imaging targets. The content of theattention directing presentations may be selected by the user 105. Insome embodiments, the image capturing app 125 may automaticallydetermine the content of the presentation based on presentationpreferences of the imaging target, attention status of the imagingtarget, and the like. A presentation preference profiles may begenerated for a person to store the person's presentation preferences.The presentation preferences may be determined based on the person'sdevice activities, usage history, social network, and the like.

Image capturing device 110 also may include a location detection deviceand applications for collecting location data, such as geo-location datavia Global Positioning System (GPS), in-door location via Bluetooth LowEnergy (BLE) and/or Near-Field Communication (NFC). Image capturingdevice 110 also may include various sensors for detecting temperaturedata, altitude data, humidity data, data regarding device movement,ambient sound data, imaging data via a camera, and etc. Further,geo-fencing or wireless beacon technology may be used to define alocation. Image capturing device 110 may detect signals from devicesthat implement geo-fencing or wireless beacon technology. Theseenvironmental data may be utilized to determine a location orenvironment in which image capturing device 110 is located.

Application server 170 may be maintained, for example, by a merchant ora software company which may provide applications or software contentsthat may be installed at consumer's computing devices to provide imagecapturing functionalities at the computing devices. Application server170 may also include an account database 120 that includes accountinformation for users having an account on application server 170, suchas user 105. Account information may include basic information of users,such as name, address, phone number, email address, and the like.Account information may include image capturing settings and/orpreferences of the user. Application server 170 may collect and storevarious contact information of users. Application server 170 also mayinclude a database that keeps track of presentation preferences amongdifferent users.

FIG. 3 is a flowchart showing a process of setting up an image capturingdevice 110 attention directing functions according to an embodiment. Atstep 302, a user may register the image capturing device 110 atapplication server 170. For example, a user may download an imagecapturing application form the application server 170 and may set up anaccount or profile for the image capturing device 110 or for the user.The account may be used for storing and managing information related toimage capturing, such as image capturing settings, user preferences,presentation preferences, and the like. In an embodiment, theinformation may be stored and managed at the image capturing device 110.The account information may include user information, such as name,contact information, picture, birth date, gender, social networkaccount, and other user related information. The account informationalso may include device information of the image capturing device 110,such as manufacturer, model number, device type, device specification,camera type, and the like.

At step 304, application server 170 and/or the image capturing device110 may monitor user activities. In particular, image capturing device110 may include various types of sensors that may detect useractivities, such as the user's location, movements, traveling paths,gestures, and the like. For example, image capturing device 110 mayinclude a location detection device, such as a GPS device configured toreceive signals from GPS satellites to detect the location of imagecapturing device 110. In another example, image capturing device 110 mayinclude a Bluetooth communication device configured to detect andreceive signals from a Bluetooth beacons installed at particularlocations. As such, the location of image capturing device 110 may bedetermined based on detecting the Bluetooth beacons. In still anotherexample, image capturing device 110 may include a NFC device configuredto detect other NFC devices located nearby. As such, when another useris located near user 105, the NFC device of image capturing device 110may detect the NFC device of another user's device. This may allow thesystem to detect that user 105 is taking pictures of another user.

Further, image capturing device 110 may monitor user 105's operation onimage capturing device 110, such as the user's operations to search orbrowse various products or services, the user's operations tocommunicate electronically with others, such as emails, text messages,and the like. Image capturing device 110 also may monitor user 105'soperations of various applications, such as communication applications,e.g. telephone, email, or messaging applications, payment applications,shopping application, social networking applications, contactinformation applications, appointment or scheduling applications, andthe like. Image capturing device 110 may monitor information entered orinput by user 105 and information requested by user 105. In anembodiment, image capturing device 110 may detect and monitor nearbydevices of other users via Bluetooth Low Energy (BLE) or other NearField Communication (NFC) channels. For example, image capturing device110 may be a wearable device, such as a smart watch. When user 105 istaking a picture of a friend of user 105, the system may detect via user105's smart watch and the friend's mobile device that they are both inthe same location.

At step 306, application server 170 and/or image capturing device 110may identify potential imaging targets and their respective presentationpreferences. Potential imaging targets may be identified based on socialinteractions with other users from the monitored user activities. Socialinteractions may include email messages, text messages, phone calls,social related appointments, meeting appointments, in-person meetings,telephone conferences, video conferences, and any other social relatedinteractions between user 105 and others. The social interactions alsomay include interactions of users on social networking sites or apps,such as FACEBOOK, INSTAGRAM, VINE, TWITTER, VIDDY, PINTEREST, and anysocial networking services. Other users with whom the user 105 interactswith may be identified as potential imaging targets, as the user 105likely may take pictures of these related persons. User activities, suchas setting an appointment on a calendar, scheduling a meeting, travel orlocation of user 105 relative to other users, also may be used toidentify potential imaging targets. In an embodiment, the system mayallow user 105 to input persons as potential imaging targets.

In an embodiment, previously captured images may be analyzed to identifypotential imaging targets. For example, if the user 105 frequentlycaptures images and/or video of a family member, the family member maybe identified from the images and/or videos by the system, such as viafacial and/or voice recognition, and may be designated as a potentialimaging target.

The system may establish a presentation preference profile for each ofthe imaging targets. The presentation preference profile may storepersonal and preference information of the imaging target. For example,the presentation preference profile may store an imaging target's (aperson's) name, online user name, age, gender, demography, relationshipwith the user, facial features, voice patterns, online activity history,device usage, location history, transaction history, presentationpreferences, and other information that may be used to identify theimaging target and to determine presentation preferences of the imagingtarget.

The system may determine presentation preferences of the potentialimaging targets based on various factors and sources. For example, apresentation preference profile may be established for each person. Thepresentation preference profile may store various information defining aperson's presentation preference, such as what type of presentation islikely to attract and/or retain a person's attention.

A person's presentation preferences may be determined based on theperson's age, gender, demography, interests, affiliation, and the like.The presentation preferences may be determined based on a person's usagehistory, such as browsing history, payment history, transaction history,travel history, and the like. For example, based on a person's deviceusage, the person's favorite TV show, favorite sport/sport's team,music, character, and the like may be determined and be used todetermine the person's presentation preference. In an embodiment, thepresentation preferences may be determined by crowd sourcing, such asbased on other users who are similar to the person in age, gender,demography, interest, and the like.

For example, a three-year-old boy may have cartoon characters, zooanimals, and cartoon animations as his presentation preferences. Theboy's presentation preferences may be determined based on the boy'svideo watching history or game playing history on a smart device. Theboy's presentation preferences also may be determined based onpresentation preferences of other similar three-year-old boys. Inanother example, a pet, such as a dog or a cat, may have certain type ofnoises, lighting patterns, images, and/or any combination thereof as thepreferred presentation.

In an embodiment, the system may determine presentation preferences bytrial and error. For example, the system may select a random orpotential presentation to an imaging target and detect the response ofthe imaging target, such as whether the imaging target's attention isdirected to or remains with the image capturing device 110. This can bedone by tracking the head and/or eye movements of the imaging target. Assuch, the system may select presentations that can better attract and/orretain the attention of a particular imaging target. The selectedpresentation or similar presentations may be designated in thepresentation preference profile for the imaging target.

In another embodiment, the user (e.g., owner or operator of the imagecapturing device 110) may set or edit/revise presentation preferences.For example, for each identified potential imaging target, the user maydesignate one or more presentation preferences, which can be furtherdefined by the type of reaction or expression desired from the imagingtarget. For example, the user may select and/or add one or more specificimages, video clips, songs, recordings, etc. for laughing, serious(e.g., something that captures the attention intently but does notelicit any specific reaction), smiling, surprise (e g., loud sounds),etc. corresponding to an image of a specific imaging target. In oneexample, the user taps on an image of the user's 4-year old son and seesan option to add or revise presentation preferences. Once selected, theuser can download specific presentations for the son, which can includeoptions to select specific categories of expression. In another example,the user taps on an image of the user's 2-year old girl or the user'sdog and uploads or saves specific presentation preferences for each. Inanother embodiment, a plurality of presentation preferences may bestored or associated with a specific imaging target for all differentconditions, such as expression desired, time of day, location, etc. Thisallows the user to generally associate preferences and then have alimited list to choose from when deciding to capture an image of thetarget. Even more broadly, the user may store presentation preferencesgenerally, such that imaging targets are not associated with their ownpreferences. In this way, the user can select a desired presentation ona case by case (or capture by capture) basis.

At step 308, application server 170 and/or image capturing device 110may store and update the presentation preference profiles of variousimaging targets including individual persons or animals. In particular,the presentation preference profile of an imaging target maycontinuously be updated to reflect recent interests and popular trends.For example, popular trends in music, movies, TV shows, media and thelike may be used to update and refresh the presentation preferenceprofile. Thus, the presentations may be refreshed to better attractand/or retain the attention of the imaging target. Updates can also bemade based on how many times a specific presentation is shown to atarget, such as a little child. This may be needed, as the presentationmay no longer attract the child's attention due to the familiarity andpossible lack of interest from repeated use.

By using the above process 300, potential imaging targets may beidentified and presentation preference profiles may be established forthe imaging targets. Further, presentation preferences of variousimaging targets may be determined and stored in the presentationpreference profiles. Each image capturing device may be associated withmany different imaging targets or potential imaging targets, such asfriends and family members of the user of the image capturing device.Presentation preference profiles may be established for each imagingtarget to store each imaging target's presentation preferences.

FIG. 4 is a flowchart showing a process for capturing images withattention directing functions according to one embodiment. At step 402,application server 170 or image capturing device 110 may receiveindication of an image capturing operation. For example, the user 105may operate the image capturing device 110 to begin an image capturingprocess. Depending on the type of image capturing device 110, this canbe done by turning on the image capturing device 110, activating animage capturing application on the image capturing device 110, operatingan image capturing button, performing a gesture on a touch screen, orthe like.

At step 404, the image capturing device 110 may detect imaging targets.The user may operate the image capturing device 110 and point the camera215 of the image capturing device 110 in a direction of the imagingtarget. The camera 215 may capture images of the imaging target. Inparticular, the images of the imaging target may be analyzed to identifythe person/persons in the imaging target. For example, the identities ofthe person or persons in the imaging target may be determined usingfacial or voice recognition techniques.

In the facial recognition process, the system may first identify areasin the images that resemble faces. The system then may extract landmarksor features of the faces in the images, such as the relative position,size and/or shape of the eyes, nose, cheekbones, and jaw. The facialrecognition may include a geometric approach and/or a photometricapproach. The geometric approach may utilize the distinguishing featuresof faces for identification. The photometric approach may utilizestatistical data that distill facial images into values and compare thevalues with templates to eliminate variances.

In voice recognition process, the system may detect the voice of theperson or persons in the imaging target. The detected voice may beanalyzed (and filtered by frequency filters) for patterns andfrequencies which may be matched against the voice profiles of knownpersons. Thus, person or persons in the imaging target may be identifiedby his/her voice. The user may also say the name of the imagingtarget(s), which the system can use to match with names of storedimaging targets. For example, the user may say “Bob Smith” and “JaneGreen” to identify the two imaging targets.

The image capturing device 110 may be associated with a plurality ofpresentation preference profile of various imaging targets, such asfamily members and/or friends related to the user of the image capturingdevice 110. The system may compare the facial images detected with thoseof the presentation preference profile associated with the imagecapturing device 110. Thus, the system may first attempt to identify anyfamily members or friends of the user in the imaging target.

If the imaging target includes faces that do not match those in thepresentation preference profiles associated with the image capturingdevice 110, the system may attempt to determine the age and/or gender ofthe person or persons in the imaging target. The age and/or gender of aperson may be determined based on unique facial features statisticallyattribute to certain age and/or gender, such as size, hair style,relative positions of eyes, nose, hair, skin textures, and the like.Thus, the age and/or gender of the persons in the imaging target may bedetermined or estimated.

The imaging target may include faces of animal pets, such as cats ordogs. The system also may analyze the facial images of pets to identifya family pet, if a presentation preference profile of a family pet isestablished. This can be done by analyzing the facial features of thepet, including relative location, size, distances of eyes, nose, mouth,fur color, body size, body shape, and the like. If the detected animaldoes not match any of the presentation preference profiles associatedthe image capturing device, the system may determine the type ofanimals, such as a cat, a dot, or the like.

In an embodiment, the system may detect the attention status of theimaging target. If the imaging target includes multiple persons oranimals, the system may detect the attention status of each of thepersons or animals in the group. The attention status may be detected bytracking the eye movement, body movement, facing direction, and the likeof the person or animal. An attention score may be assigned to a personto indicate how much attention the person is directing to the imagecapturing device 110. The more attention a person is paying to the imagecapturing device 110, the higher the attention score may be. Forexample, if the person is facing toward the image capturing device 110and is looking directly at the camera, the person may be assigned ahigher attention score. In contrast, if a person is looking away fromthe image capturing device 110 or if the face of the person is coveredor is hiding behind some person objects, the person may be assigned alower attention score. The attention score of the person maycontinuously updated in real time as the system detects and analyzes theperson's movements and positions in real time.

At step 406, the image capturing device 110 or the application server170 may determine attention directing presentations based on the imagingtarget. If a person (or pet) in the imaging target is identified to havea presentation preference profile with the system, the system maydetermine the content of the presentation based on the preferencesstored in the presentation preference profile to attract and/or retainthe person's attention before and/or during an image capturing process.The presentation preference profile may indicate specific types ofsound, images, music, movies, sports teams, themes, characters,animations, patterns, or the like that are effective at attractingand/or retaining the person's attention. For example, a five-year-oldgirl of the user may have a presentation preference profile storing thegirl's preference for certain cartoon characters, TV shows, music, orthe like.

If a person (or pet) is not identified to have a presentation preferenceprofile with the system, the system may determine the content of thepresentation based on the estimated age and/or gender of the person orbased on the type of animal of the pet. The system may generate orselect types of presentation that can effectively attract and/or retainattention of persons of similar age and/or gender. Thus, for personswithout presentation preference profiles, the content of presentationmay be determined or selected based on crowd sourcing.

In an embodiment, the presentation may be determined based on theimaging target's attention status. For example, if the person of theimaging target is looking and paying attention to the image capturingdevice 110, the system may select or determine a presentation withmoderate intensity (moderate noise or color contrast) to retain theperson's attention. If the person of the imaging target is looking awayfrom or is not paying attention to the image capturing device 110, thesystem may select or determine a presentation with higher intensity(louder sound or noise, flashier display) to attract or direct theperson's attention toward the image capturing device 110. For example,if a child is already looking at the camera, the system may select ashort cartoon video as the presentation. If the child is looking awayfrom the camera, the system may select a video starting a loud noise(e.g., firework) or the like to attract the child's attention to theimage capturing device 110.

In an embodiment, the presentation may be determined based on context,such as location, nearby events/attractions, time of the day, season,and the like. For example, if the image capturing process is occurringnear an zoo, a presentation related to animals, such as animals imagesand sounds, may be selected. If the images are taken closer to lunchtime, food related presentations may be used. If the images are takenduring Christmas season, music or videos related to winter holiday maybe selected.

The imaging target may include multiple persons. The system maydetermine a presentation based on the combined preferences of multiplepersons. For example, the system may identify multiple persons in theimaging target and may determine the presentation preferences for eachof the persons in the group. The system may select or determine apresentation based on the shared preferences of the group. For example,the system may determine a type of presentation that is preferred bymost of the persons (majority) in the group.

In an embodiment, the system may determine a presentation based on theattention status of multiple persons. In particular, the system maydetermine a presentation more tailored to the persons in the group whoare not paying attention to the image capturing device 110. Thus, thepresentation may more effectively attract the attention of those who arenot looking at the camera. For example, the preferences of persons whoare not looking at the camera may be prioritized or weighted more fordetermining a presentation.

In an embodiment, the preferences of multiple persons may be prioritizedbased on various factors. For example, the preferences of multiplepersons may be prioritized based on age. Presentation preferences ofyounger persons, such as young children, may be prioritized overpreferences of older persons. This is because young children may requiremore help in directing and/or retaining their attention at the camera.Presentation preferences of multiple persons also may be prioritizedbased on the positions of persons in the imaging target. For example,preferences of persons positioned farther away from the image capturingdevice 110 maybe prioritized over persons positioned closer to the imagecapturing device 110. In another example, preferences of personspositioned on the side maybe prioritized over persons positioned nearthe center. This is because persons positioned farther away or on theside may require more help in directing their attention to the camera.

In an embodiment, the system may continuously modify and/or update thecontent of the presentation in real time based on the detected movementand/or attention status of the person or persons in the imaging target.For example, the system may monitor and keep track of the eyes and/orfaces of multiple persons in the imaging target and may adjust or modifythe content, intensity, speed, color contrast, and sound volume of thepresentation to more effectively attract and direct the attention of thepersons in real time.

The system may select from various types of presentations based on theimaging target. The presentations may include sound patterns, lightingpatterns, videos, music, sound clips of animals, sound clips ofvehicles, and the like. Some presentations may also include visualand/or voice count down to the image capturing moment. Somepresentations may include animations that move closer to the camera asthe image capturing moment approaches. Some presentations may increasein intensity (visual and/or audio) as the image capturing momentapproaches. The contents of the presentations may be stored with theimage capturing device 110. In some embodiments, the contents of thepresentations may be stored in a presentation database at theapplication server 170. Once selected, the content of a presentation maybe communicated from the application server 170 to the image capturingdevice 110.

In another embodiment, the user 105 simply selects one or morepresentations through a user interface of the image capturing device110. For example, when the user 105 opens a camera app or otherwiseindicates a desire to capture an image, the user may be presented with alist of stored presentation preferences as part of the camera view, suchas through an overlay. The user 105 may then tap or otherwise select adesired presentation. For example, the list may include severaldifferent songs, videos, audio recordings, images, and the like that theuser can select for the particular image capturing session.Advantageously, this simplifies the process and enables more control bythe user.

At step 408, the image capturing device 110 may present or output theattention directing presentations to the imaging target. In particular,the user 105 may point the camera toward the imaging target and mayoperate the image capturing device 110 to capture images (videos) of theimaging target. In response, the image capturing device 110 may begin topresent the selected presentations to the imaging target to attractand/or retain the attention of the imaging target at the camera. Thepresentation may be output to the imaging target before and/or duringthe image capturing process. For example, the presentation may bedisplayed or output several seconds before the image capturing moment.In the case of a video capturing process, the presentation may bedisplayed and/or output before and during the video capturing period. Indifferent embodiments, the presentation ma repeat or a next presentationmay be displayed. Settings for repeat, random, or selected sequence maybe set by the user.

At step 410, the image capturing device 110 may capture images of theimaging target. In particular, the image capturing device 110 maycapture images along with the attention directing presentations. In someembodiments, the image capturing device 110 may display or output thepresentation to the imaging target and may determine the image capturingmoment (or period if video) based on the response of the imaging target.For example, the image capturing device 110 may output the presentationand may monitor the attention status of the person or persons in theimaging target and may automatically capture images (videos) when theperson is looking toward the camera or when the persons in the imagingtarget are all looking at the camera. In another example, the imagecapturing device 110 may automatically capture images (videos) when theperson is having a certain facial expression, such as (smile, laugh, andthe like). This may ensure that the images (videos) are taken when theperson or persons are looking at the camera or when the person orpersons are having a certain facial expression desired by the user.

In some embodiments, the image capturing device 110 may analyze thecaptured images and may select images in which the imaging target ispaying attention to the camera. For example, the image capturing device110 may capture a plurality of images consecutively (burse mode) and mayselect images in which the person or persons are all looking at thecamera. The images in which the person or persons are not all looking atthe camera may be deleted.

Accordingly, the image capturing device 110 may include attentiondirecting functions to direct and/or retain the attention of the imagingtarget. The content of the presentation may be customized based on theimaging target and the attention status of the persons in the imagingtarget to more effectively direct, attract, and/or retain the attentionof the person or persons in the imaging target.

The above processes 200 and 300 may be implemented at the imagecapturing device 110. In an embodiment, the above processes 200 and 300may be implemented at the application server 170. In still anotherembodiment, the above processes 200 and 300 may be implemented by theimage capturing device 110 and the application server 170 incoordination with each other. Note that the various steps describedherein may be performed in a different order, combined, and/or omittedas desired.

The foregoing disclosure is not intended to limit the present disclosureto the precise forms or particular fields of use disclosed. As such, itis contemplated that various alternate embodiments and/or modificationsto the present disclosure, whether explicitly described or impliedherein, are possible in light of the disclosure. Having thus describedembodiments of the present disclosure, persons of ordinary skill in theart will recognize that changes may be made in form and detail withoutdeparting from the scope of the present disclosure. Thus, the presentdisclosure is limited only by the claims.

What is claimed is:
 1. An image capturing device comprising: a cameraconfigured to capture an image; a presentation device configured tooutput a presentation; a non-transitory hardware memory storing anapplication for performing attention directing functions; and one ormore hardware processors coupled to the non-transitory hardware andconfigured to read instructions from the non-transitory memory to causethe image capturing device to perform operations comprising: receiving auser operation to capture an image of an imaging target; determining,based at least in part on the imaging target, a presentation tocommunicate to the imaging target; outputting, by the presentationdevice, the presentation to the imaging target before and/or during animage capturing process of the camera; and capturing, by the camera, animage of the imaging target.
 2. The image capturing device of claim 1,wherein the camera and the presentation device are disposed on a sameportion of the image capturing device facing the imaging target when theimage capturing device is capturing an image of the imaging target. 3.The image capturing device of claim 1, wherein the presentationcomprises a video presentation directing an attention of the imagingtarget toward the camera.
 4. The image capturing device of claim 1,wherein the presentation comprises an audio presentation directing anattention of the imaging target toward the camera.
 5. The imagecapturing device of claim 1, wherein the presentation increases inintensity as a moment of image capturing approaches.
 6. The imagecapturing device of claim 3, wherein the video presentation comprises ananimation moving towards the camera as a moment of image capturingapproaches.
 7. The image capturing device of claim 1, wherein the one ormore hardware processors are further configured to cause the imagecapturing device to perform operations comprising: identifying a personin the imaging target; and determining a content of the presentationbased on the person in the imaging target;
 8. The image capturing deviceof claim 7, wherein the content of the presentation is determined basedon a presentation preference profile of the person including one or moreof the person's age, gender, demographic, device usage history, andsocial network.
 9. The image capturing device of claim 7, wherein thecontent of the presentation is determined based on preferences of otherusers similar to the person.
 10. The image capturing device of claim 1,wherein the one or more hardware processors are further configured tocause the image capturing device to perform operations comprising:monitoring an attention status of the imaging target; and determining acontent of the presentation based on attention status of the imagingtarget.
 11. The image capturing device of claim 10, wherein the contentof the presentation is modified based on changes to the attention statusof the imaging target in real time.
 12. A method for image capturing,the method comprising: receiving, by an image capturing device, a useroperation to capture an image of an imaging target; determining, basedat least in part on the imaging target, a presentation to communicate tothe imaging target; outputting, by a presentation device of the imagecapturing device, the presentation to the imaging target before and/orduring an image capturing process; and capturing, by a camera of theimage capturing device, an image of the imaging target during the imagecapturing process.
 13. The method of claim 12, further comprising:identifying a person in the imaging target; and determining a content ofthe presentation based on the person in the imaging target;
 14. Themethod of claim 12, further comprising: monitoring an attention statusof the imaging target; and determining a content of the presentationbased on the attention status of the imaging target.
 15. The method ofclaim 13, further comprising: monitoring device activities of theperson; and determining a presentation preference profile of the personbased on the device activities of the person.
 16. The method of claim13, further comprising determining a presentation preference profile ofthe person based on one or more of age, gender, demography, and socialnetwork of the person.
 17. The method of claim 12, further comprising:identifying a plurality of persons in the imaging target; anddetermining a content of the presentation based on combined presentationpreferences of the plurality of persons.
 18. The method of claim 17,wherein presentation preferences of the plurality of persons areprioritized based on respective persons' attention status.
 19. Themethod of claim 12, further comprising determining a content of thepresentation based on preferences designated by the user.
 20. Anon-transitory machine-readable medium having stored thereonmachine-readable instructions executable to cause a machine to performoperations comprising: receiving, by an image capturing device, a useroperation to capture an image of an imaging target; determining, basedat least in part on the imaging target, a presentation to communicate tothe imaging target; outputting, by a presentation device of the imagecapturing device, the presentation to the imaging target before and/orduring an image capturing process; and capturing, by a camera of theimage capturing device, an image of the imaging target during the imagecapturing process.